Picture for Yutong Xie

Yutong Xie

IRIS: time-structured manifold projections

Add code
May 29, 2026
Viaarxiv icon

Finding the Correct Visual Evidence Without Forgetting: Mitigating Hallucination in LVLMs via Inter-Layer Visual Attention Discrepancy

Add code
May 20, 2026
Viaarxiv icon

Towards Physically Consistent 4D Scene Reconstruction for Closed-loop Autonomous Driving Simulation

Add code
May 20, 2026
Viaarxiv icon

CA-GCL: Cross-Anatomy Global-Local Contrastive Learning for Robust 3D Medical Image Understanding

Add code
May 13, 2026
Viaarxiv icon

Unveiling Deepfakes: A Frequency-Aware Triple Branch Network for Deepfake Detection

Add code
Apr 19, 2026
Viaarxiv icon

See Fair, Speak Truth: Equitable Attention Improves Grounding and Reduces Hallucination in Vision-Language Alignment

Add code
Apr 10, 2026
Viaarxiv icon

Beyond the Global Scores: Fine-Grained Token Grounding as a Robust Detector of LVLM Hallucinations

Add code
Apr 06, 2026
Viaarxiv icon

MedObvious: Exposing the Medical Moravec's Paradox in VLMs via Clinical Triage

Add code
Mar 24, 2026
Viaarxiv icon

AURORA: Adaptive Unified Representation for Robust Ultrasound Analysis

Add code
Mar 19, 2026
Viaarxiv icon

ClinCoT: Clinical-Aware Visual Chain-of-Thought for Medical Vision Language Models

Add code
Mar 01, 2026
Viaarxiv icon